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© Instruction decoding microengines. 



© A pipeline architecture computer (24) has two 
interconnected microengines (53,60) operating si- 
multaneously in an instruction preparation unit (10). 
Each microengine (58,60) operates on the same 
portion (53) or different portions (53,55) of an in- 
struction concurrently within the same clock cycle. 
To assure coordination, one microengine (60) serves 
as the executive or attribute generator and the other 
the main microengine (58). The main microengine 
(58) generates routines to control the instruction 
preparation unit (10). The executive microengine 
(60) prepares the next task in parallel and supplies 
arguments to the main microengine (58) as the main 
microengine (58) generates its current routine. At the 
completion of a routine performed by the main 
microengine (58), it signals the executive microen- 
gine (60) to obtain the next task and associated 
£jj parameters. The instruction preparation unit (10) thus 
^translates variable length instruction word (53,55) 
CO from an instruction memory (16) of the pipeline 
^architecture computer (24) into a local computer 
^instruction (68) in the form of elementary opcode 
^- operations and final address values for data fetching 
and operation execution. 
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Background of ft* jnventar 



This invention relates to general purpose com- 

3S £*T2££ in a pipeHned computer 

processing of information in a computer at 
hioh^ has been accomplished in a number of 
high ratss naa u . ern j tte r coupled 

rS-S^Sen" used to obtain high doc, 
109 IrSar applications. Parallel processors 
f 6 ! ^nTsed'o perform high speed processing • 
have been useo to v multiple microen- 

^fZSXJ^^ me partial result 00 

does it * 9 f*o°**ng t»» «* 

a starting point address. 
Summary, of the Invention 



,n accordance with the present invention an 



nPCQOlNG WPCHQ ENGINES 

fi&! d and a - •^3^^ 

first microengine means c°"P>^ « v ^ able 
me ans decodes an opcode po of ^ ^ 

an .nput of the ^ -c 9 tQ generate a 

tor means P erforms compu t e r instruction. 

TS second portion of *» local cornp ^ 

^ meanS 
outputs of the first and s ^ ^ variable 

and to the ^0 ^^ microengine primarily 
length instruction. The si instruct jon and 

20 generates routines to J™*^^ as an 
the second microengme prima ^ -p 

~, w jrsxs^^ te first 

« routines. Each microengme comprises a 

demeans and a r^er -eans^ ^ ^ 

ln accordance w * a a 

invention, a method is proviue computer 

^ bte Socro^ 

30 system into a Jtocal c«r p jn 
the steps of storing | «" p repa ration unit, 
a register means of an |ength in- 

decoding a first ^^.^^Uth 
action and a second pontt * ^ a ^ 

coupled to an input of tt» first rrac y 

piuraiity of ^/^SLc^** of the vari- 
gine means and to the secona p 
able length instruction , m the regster ^ 
having an output coupled to an .np ^ ^ 

50 microengine i^^^Z^ comprises 
portion of the viable length mst ^ 
decoding an opcode fie w a cQm _ 
length instruction having the secona y, 
p Ses decoding at least one spscfter field. 
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Brief Description of the Drawings 

Other ami further features and advantages of 
the invention- wil become apparent in connection 
with the a ccompa nying drawings wherem: 

FIG. 1 is a block diagram of an instruction 
preparation unit embodying the invention; 

FIG. 2 is a block diagram of a pipelined 
computer system showing the instruction prepara- 
tion unit of FIG. 1 interconnected within the system; 

RG. 3A shows the variable length instruction 
format having an opcode field and up to N operand 
specifier fields for decoding by the instruction 
preparation unit of RG. 1 ; 

FIG. 3B shows an instruction format for a 
two operand longword ADO instruction; 

FIG. 4 shows the local computer instruction 
(LCI) output format produced by the instruction 
preparation unit. 

FIG. 5 shows an instruction format for a push 
longword source operand onto stack (PSHLG) in- 
struction; 

FIG. 6 is a microcode flowchart for the 
PSHLG instruction showing the parallel operation of 
the microengine and the attribute generator during 
the instruction execution. 



Description of the Preferred Embodiments 

Referring now to FIGS. 1 and 2, there is shown 
a block diagram of an instruction preparation unit 
10 and a pipelined computer system 24 comprising 
the instruction preparation unit 10. The instruction 
preparation unit 10 comprises a microengine 58 
and an executive microengine or attribute generator 
60 which are interconnected and operating in par- 
allel on a variable length instruction 51 input stored 
in instruction register 50 to convert the variable 
length instruction 51 into a local computer instruc- 
tion 68. The microengines 58 and 60 are cross- 
coupled with the outputs of each one conditionally 
controlling and defining the inputs of the other one. 
A microengine 58, 60 provides the signals of the 
lowest control level for operating the computer sys- 
tem. The variable length instruction 51 as shown in 
FIG. 3A comprises an opcode field 53 and at least 
one specifier field 55 which are processed in par- 
allel because each microengine 58 and 60 operates 
on different portions or fields of the instruction at 
the same time. However, some instructions in the 
computer system 24 may have only an opcode and 
no specifier in which case both microengines 58 
and 60 operate on the opcode. Both microengines 
58 and 60 must be tightly coupled or intercon- 
nected so that each continually knows where the 
other had started, what the other is doing and what 
output is expected from each microengine 58, 60. 



As a result, a decision must be made at the start of 
each instruction decoding as to what portion of the 
processing load is to be accomplished by each 
microengine .58, 60. Also, since the portions of an 

5 instruction are interdependent on each other (e.g. 
specifier evaluation lengths for an opcode), param- 
eters must be requested and passed between 
microengines 58, 60 as they progress through their 
processing sequences. Finally the two microen- 

ro gines 58, 60 in concert produce the required local 
computer instruction (LCI) 68 output. An instruction 
stream is provided to the IPU 10 by an instruction 
prefetch control 70 which causes the next instruc- 
tion memory field to be fetched so that while the 

75 cunent variable length instruction 51 is being pro- 
cessed, the next variable length instruction field is 
waiting to be loaded into the instruction register 50. 

Referring to FIG. 1 the input to microengine 58 
is controlled by multiplexer 54 along with logic 

20 selector 63 which determines the source of the 
input to microengine 58. There may be eight sour- 
ces of inputs to the multiplexer 54 each of which is 
selected by the 3-bit output to the multiplexer 54 
from the logic selector 63. Such sources provide 

25 addresses to a control store 57 memory <8K words 
by 109 bits) in the microengine 58 and the control 
store 57 output is stored in a register 59. The 
addresses are provided by control sources such as 
a next microengine address 76 from attribute gen- 

30 erator 60, a next address 96 from microengine 58, 
an opcode from the instruction register 50 and a 
decoded specifier from specifier decoder 52. An- 
other input source is provided by the calculator 64 
on line 65 which generates results in response to 

35 data from the instruction register 50 specifier fields, 
from the calculator arguments and control lines 92, 
from the microengine 58 and from the calculator 
arguments 86 generated by the attribute generator 
60. Output lines 67 from the calculator 64 are fed 

4d to the LCI multiplexers 66 along with the LCI ar- 
guments 84 outputs from the attribute generator 60 
which together form a 58 bit local computer in- 
struction (LCI) 68 in accordance with control sig- 
nals received from the microengine 58. The format 

45 of the LCI output word comprising 6 fields is shown 
in FIG. 4. 

Still referring to FIG. 1 . The input to the execu- 
tive microengine or attribute generator 60 is con- 
trolled by multiplexer 56 and may be an opcode 53 

so from the instruction register 50 or a next address 
80 generated by the attribute generator 60. One of 
these inputs is fed to a control store 61 memory 
(4K words by 59 bits) and the output of the control 
store 61 is stored in a register 62 in the attribute 

55 generator 60. The ability of the attribute generator 
60 to feed back to its input an AG next address 80 
allows sequencing of the attribute generator 60 
through a program in its control store memory 61 
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essentia c ^Jl^ach conserves microprc, 
varying tang** m ^ s P ^ e attribute generator 60 
. gram address logons. tQ the multiplexer 56 
provides a ^f^o input address sources 
for selecting one of the ™ addreS s 76 or 

it also generates a m oeng j n e control s.g- 
'mi oengine 58 «K P^^SSb* B—-* 
Tils 78 Another output «w ^ to the 

SSSlr 66 * *^J?£n* gen- 
action 68 in ^"'^-"^ calculator 6*. 
Sd by the microengme 58 £d a b(ocK 

^ herring "^J 0 system 24 corn- 

diagram of a P'P^ Ration unit 10- The 
prising the instra^on rep obtaine d from 
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tion is requ-red. Howev *e results *e 
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instruction p«P«* ™ instr uction data bu * 
bits of information over ffi addre ssed via 

vJnen the instruction memory ^ 
£ computer over the computer 
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between the ^ d arithmetic manipu' 
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one's complement. s»gn J" three arithmetic 
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point calculations. an example of *ie 

Referring now to ^ micro engine 58 and 
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25 SSL generator 60 for ^ ^ ^ 

wo operand longword (32 » [(jngword A oO 

J,s ft. i"*"* 00 ft SJoWO- STORE, etc. 

The output of the irvj ^ computer 
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instruction (LCI) 68. * na _ a l0 . blt ex 
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FIG. 3B. The IPU TO decodes the- opcode plus the 
next two operand specifiers as required in one 
rrricrocycie and generates a specifier evaluated in- 
struction for the pipeline in the rroet rrricrocycie for 
further processing: by the system. This requires the 
IPU 10 decoding logic to examine the contents of 
one to seven bytes of an instruction in one micro- 
cycle in order to produce a calculated output in the 
next microcycle. The two microengines 58 and 60 
each operate on a different portion of the instruc- 
tion at the same time. Rhi :fi microengine knows 
where the other one started, what the other one is 
doing and what output is expected from each 
microengine. Hence, a decision is made at the start 
of each instruction as to what portion of the pro- 
cessing load is to be accomplished by each 
microengine. Since portions of the instruction are 
interdependent on each other (e.g., specifier evalu- 
ation lengths on the opcode), parameters must be 
requested and passed between microengines 58 
and 60 as they progress through their processing 
sequences to produce the required output. One of 
the microengines, the attribute generator 60, dis- 
patches tasks and parameters to the main microen- 
gine 58 which generates routines to control the IPU 
TO. The main microe ngine 58 performs a routine 
using parameters passed to it from the executive 
micro engine 60 to produce the required LCt 68 
output for the processing pipeline. The executive 
function for the n&xt routine is formed in* parallel 
with the present routine being performed by 
microengine 58. At the completion of a routine, the 
microengine 58 signais the executive or attribute 
generator 60 to obtain the next task command to 
process aiong with any parameters associated with 
that task. 

An opcode 53 is examined at the start of an 
instruction and the entry point into the executive 
microengine 60 is determined. Since this executive 
microengine 60 produces attributes for the main 
microengine 58 which are a function of the instruc- 
tion, such as data type, data length and index 
multiplier, it is referred to as the attribute generator 
60. The main microengine 58 uses as an entry 
point, either the opcode 53 or the specifiers 55 
which is determined by the opcode 53. If the 
instruction has no associated specifiers or work is 
done before the first specifier is evaluated, entry is 
by opcode; otherwise, it is by specifier(s). On 
specifier entry, if the instruction has only one 
specifier or if the results of evaluating two specifi- 
ers can not be packed into a single LCI 68, the 
specifier following the opcode is evaluated. Finally 
on specifier entry if the evaluation of the first two 
specifiers can be packed into one LCI 68 the two 



specifier values will determine the entry point After 
initial evaluation of the specifier(s) or opcode, eval- 
uation of specifiers occurs on a single specifier 
basis as required. 

s fn the selected instruction format specifics 55 

do not uniquely identify the operation, but reiy on 
the opcode 53 for modifying attributes. An example 
of this is an ADO word with a register-indirect index 
specifier. Here, an index modifier, 2, and the length 

to of the data to be operated on are determined by 
the opcode. The specifier evaluation routines in the 
microengine 58 can be considered as a set of 
generic routines for ail instructions. This being the 
case, the attribute generator 60 must supply the 

T5 required arguements for the microengine 58 and 
the LCI 68 output as a function of the opcode. 

The attributes required by the calculator 64 
vary on the basis of both the opcode and the 
relative specifier position. In addition, the same 

20 parameters may require variations during the 
microengine 58 routines required to compile pro- 
cessing the instruction. The attribute generator pro- 
duces the following microengine control 78 outputs 
for use by the microengine 58 for its calculations. 

25 1. Shift code required for index multiplica- 

tion; 

2. Sign extend control for byte and word 
fengths; 

3. Floating point control; 

30 4. Align immediate data contra* far second 

longword of a quadword (64 bits). 

The attribute generator 60 also produces a 
memory access type argument for al! specifiers for 
exception detection. For example, a memory write 

35 access type argument is required for the second 
specifier in FiG. 3B and is produced by the at- 
tribute generator 60. If the microengine 58 detects 
an immediate operation in its specifier evaluation 
(i.e., write to an immediate), an exception routine is 

40 entered on the basis of the access type. An addi- 
tional output is provided denoting the next specifier 
length when known only by the attribute generator 
60. This is the case for instructions such as branch 
instructions which have no explicit specifiers but 

45 are rather followed by a byte displacement. The 
next specifier length is used to increment the pro- 
gram counter. 

When the microengine 58 has completed a 
generic specifier evaluation, it must determine 

so where to vector to next in the sequence of events. 
This is analogous to the return linkage at the end of 
a subroutine. Hence, it becomes evident that a 
• micro-subroutine stack is not required. Another way 
of interpreting the vector is to consider it as the 

55 next operation to process in the sequence of 
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events as ^^Ca^ribute generator 60 
mand as it comes ftomi w* ^ follown g 

ma5f vector the rn.croeng.ne 
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^^Snextgene.cinstru^ 

specifier in the specifier in 

3 . Evaluate ^e ^ gpedfied by 

the microengine 58 from * 
theaW nX^roen 9 - 53 MP - — - 

end of the present mjucton^o when 
of opcode or spec.Se r entry ed R may be 
follow-on specter evaiuat^ £ ^ ^ a 
invoked by one of w ° v ^' a speC ial specifier 
generic specifier evaluation or a p & ^ 

evaluation entry P^*^,, specifiers dif- 
point allows for evaluating Mow ^ ^ 
ferently. ^vonng specrfi c V 9 instruc tions. 
generated by compters for f()r ^ 

Tnis process saves o™°^ up ^ instr uc- 
most likely specifiers, tnus s P 
8on evaluation. ^^^^ 58 reaches a decision 
When the rn.croeng.ne direct 

point in •^.-JSrSS. Processing 
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gine 58 control store 5/_ tne microengine 58 
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microcode in the ^J 1 ^^ tne microengine 
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processed, * f,e ^ uC 7 d by microengine 58 for 
(OHU) opcode is P^^Jthe attribute gener- 
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ator for generic specifiers dernanded by the 
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PSHLGMO. The attribute generator 60 enters the 
instruction by opcode at routine PSHLGAGO. The 
microengine 58 evaluates the generic specifier and 
places the immediate or final memory referenced 
address in field 6F and the operand type in field 5 of 
the LCI 58. It also enables the- other fields, as 
required, from the attribute generator 60. For im- 
mediate values, the microengine 58 provides the 
OHU 12 opcode to pass unchanged to the EU 14. 
For memory references, the attribute generator 60 
provides the read longword opcode for the OHU 12 
under control of the microengine 58. The attribute 
generator 60 provides the additional arguments to 
form the remander of tha LCL 65. It produces the 
EU 14 opcode of "move to temporary 1 ' for field 1, 
the OHU 1 4 opcode of "read langword" for field 2 
and the EU t4- "temporary register number" for 
field 4 of the LCI 68. The attribute generator 60 
also generates the address vector used by the 
microengine 58 in the next cycle (PSHLGM1) and 
the control code to vector the microengine 58 to 
that address. In the last cycle of the generic speci- 
fier evalution associated with PSHLGMO, the 
microengine 58 enables the output LCI 68, selects 
the attribute generator 60 control code and address 
vector for the next address- It also increments the 
attribute generator 60 for the next cycle 
(PSHLGAG1) after use of the aforementioned ad- 
dress vector. 

In the next cycle the work of supplying the LCI 
68 which decrements the stack: pointer and stores 
the longword on the stack is done exclusively in 
the microengine 58 at PSHLGM1. The attribute 
generator 60 at address PSHLGAG1 supplies the 
output required to increment the trail sequence 
number, field 3, for the next LCI 68 of the new 
instruction. Both microengines 58 and 60 also set 
up their next microaddress for evaluating a new 
instruction. This example of the PSHLG instruction 
demonstrates the "tightly coupled" operation of 
microengines 58 and 60 and their parallel opera- 
tion, thereby speeding up the instruction execution 
time. 

This concludes the description of the preferred 
embodiment. However, many modifications and al- 
terations will be obvious to one of ordinary skill in 
the art without departing from the spirit and scope 
of the inventive concept For example the size of 
the control store 57, 61 in each microengine 58, 60 
is directly related to the particular variable length 
instructions 51 being converted to local computer 
instructions 68. Therefore, it is intended that the 
scope of this invention be limited only by the 
appended claims. 



Claims 

1 . An instruction preparation unit of a computer 
system comprising: 

s register means (50) for storing a variable length 
instruction; 

first microengine means (58) coupled to said regis- 
ter means (50) for decoding a first portion (53) of 
said instructions and decoding a second portion 
70 (55) for said instruction having said second portion 
(55); 

second microengine means (60), coupled to said 
register means (50) and having an input coupled to 
an output of said first microengine means (58) and 

T5 an output (76,78) coupled to an input of said first 
microengine means (58), for decoding said first 
portion (53) of said instruction to provide a first 
portion (68) of a local computer instruction; and 
calculator means (64), having inputs coupled to a 

20 plurality of outputs of said first and second 
microengine means (58,60) and to said second 
portion (55) of said instruction in said register 
means (50) and having an output (65) coupled to 
an input of said first microengine means (58), for 

25 performing calculations to generate a second por- 
tion of said local computer instruction (68). 

2. An instruction preparation unit according to 
claim 1, characterised in that: 

said first portion (53) of the instruction comprises 
30 an opcode field and for said instruction having said 
second portion, said second portion (55) comprises 
at least one specifier field. 

3. An instruction preparation unit according to 
claim 1 , characterised in that: 

35 said first microengine (58) and said second 
microengine (60) are interconnected whereby said 
output of said first microengine (58) is coupled to 
said input of said second microengine (60) and 
said output of said second microengine (60) is 

40 coupled to said input of said first microengine (58), 
for conditionally controlling and providing said input 
to each other of said first microengine (58) and 
second microengine (60). 

4. An instruction preparation unit according to 
45 claim 2, characterised in that: 

said first microengine means (58) receives an ad- 
dress via a multiplexer means (54) having a plural- 
ity of address source inputs including address 
sources from said register means (50) and said 
so second microgengine means (60). 

5. An instruction preparation unit according to 
claim 4, characterised in that: 

said multiplexer means comprises a logic selector 
(63) for selecting one of said plurality of address 
55 sources in accordance with control signals received 
from said first microengine (58), said second 
microengine (60) and said opcode portion of said 
instruction word. 
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6 . An instruction preparation unit according to 
claim 1, characterised." *!f rises a corr trat 
said: first rrncraengme (58). compnses 
stare mearrs (57) and (57> is 

=555SSsr--=a- 

claim 1. characterised in that confrol 
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© Instruction decoding microengines. 



© A pipeline architecture computer (24) has two 
interconnected microengines (58,60) operating si- 
multaneously in an instruction preparation unit (10). 
Each microengine (58.60) operates on the same 
portion (53) or different portions (53.55) of an in- 
struction concurrently within the same clock cycle. 
To assure coordination, one microengine (60) serves 
as the executive or attribute generator and the other 
the main microengine (58), The main microengine 
(58) generates routines to control the instruction 
preparation unit (10). The executive microengine (60) 
prepares the next task in parallel and supplies ar- 
guments to the main microengine (58) as the main 
microengine (58) generates its current routine. At the 
completion of a routine performed by the main 
microengine (58). it signals the executive microen- 
M gine (60) to obtain the next task and associated 
^parameters. The instruction preparation unit (10) thus 
translates variable length instruction word (53,55) 
40 from an instruction memory (16) of the pipeline 
£ architecture computer (24) into a local computer 
instruction (68) in the form of elementary opcode 
operations and final address values for data fetching 
<^and operation execution. 
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1. Claims 1-8,10-12: Instruction preparation 

2. Claim 9: Operand handler 
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